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ON THE MASAMI YASUDA STOPPING GAME 

KRZYSZTOF J. SZAJOWSKI 

^^* Abstract. The sero-sum stopping game for the stochastic sequences has been 

1*—^ formulated in late sixties of the twenty century by Dynkin [S] . The formulation 

rvl had the assumption about separability of decision moment of the players which 

f . simplified the construction of the solution. Further research by Neveu [22j 

^1^ extended the model by admitting more general behaviour of the players and 

•^r their pay-offs. In new formulation there is the problem with existence of 

I _,^ the equilibrium. The proper approach to solution of the problem without 

fvj restriction of former models was developed by Yasuda [H]. The results was 

crucial in these research. The author made often reference to the Yasuda's [33] 
result in his works (see [36l|37l[33) as well as see results of others stimulated 
by this paper. Withal, in this note another stopping game model, developed 
Mh by Yasuda with coauthors (see e.g. jTJ and 0^) is recalled. The application 

,^ of the model to an analysis of system of detectors shows the power of the game 

j^ theory methods. 

Cj In the last part of the paper I would like to express my personal relation to 

I the Masami Yasuda game. 

> 

29 1- Introduction 

o\ 

^ The mathematical modehing of economic and engineering systems in stochas- 

^ tic environment leads to various mathematical optimization and game theory 

^T) problems. If the decision problem relies on choice of intervention moments one 

^—1 can formulate the model of such case as the optimal stopping problem. If it is 

>- allowed to react more than once the approach depends on the number of deci- 

'k> sion makers and their aims. If there is one decision maker and two reactions 

;_i (or fix number of possible moment of actions) we have the optimal two stopping 

(multiple stopping) problem. When there are two decision makers with their 

prescribed aims we usually treat the problem as the stopping game. The related 

models bring very subtle mathematical questions concerning the correctness of 

the model, possibility of inference about rational strategies, their realization and 

the existence of solution in the formulated mathematical model. In this note 

I would like to focus our attention of two group of models not very precisely 
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2 K. J. Szajowski 

defined. The first one is related to the existence of solution under mild as- 
sumption on the processes defining the payoffs in the zero-sum stopping game 



related to problem introduced by Dynkin [5] (see the section 1.1). The second 
group of the problems, which I have applied recently to modelling the sensor net- 
works, developed by Yasuda with co-authors (see e.g. [Mj and jlQ]), is devoted 
to multivariate stopping problem when there are decision makers having some 



interactions between each others (see the section 1.2). 



1.1. The randomize strategies in Dynkins game. E. B. Dynkin [5] pre- 
sented the following problem: two players observe a stochastic sequence X„, 
n = 1,2,.... Each of them chooses a stopping time, say A (resp. /i) be the 
stopping time chosen by the first (resp. the second) player. It is additionally 
assumed that the first player can stop at odd and the second at even moments. 
The pay-off is then: R{X, fi) = EXaa/^- The player 1 seeks to maximize the ex- 
pected pay-off, and the player 2 seeks to minimize it, it means that the solution 
is a pair (A*,/i*) such that 

(1.1) i^(A,/i^)<(A^/x^)<(A^/i). 

J. Neveu [22] modified this problem as follows: there are three random se- 
quences {Xn,J^n), {yn,J^n) and (Wn, J^n) with 

Assumptions 1.2. 

(1.3) Xn<Wn< Yn foT cach n G N, 
and the pay-off equals: 

(1.4) R{\ fi) = E{XaI{a<^} + Wxl{x=,} + i;i{^<A}}, 

where A, /i G © are stopping times with respect to J-'^.This problem has solution 
which is presented in [22j. 

When the assumption L2 is not fulfilled then, in general there are no equilib- 
rium in the set of stopping times with respect of observed processes {Xn,J^n), 
{Yn,J^n) and (W„, J-'n). It is M. Yasuda who shown in [33] that the mixed ex- 



tension of this game has equilibrium without the assumption |1.2[ The mixed 
extension in this case means that the set of strategies (stopping times) is ex- 
tended to include randomized stopping time. First, a finite horizon problem is 
considered. Next, the existence of the value and the equilibrium point in the 
infinite horizon problem with a discount factor is proved under some natural 
assumption concerning the integrability of the considered processes. 

In that time there were many mathematicians doing research in stopping game 
(see e.g. the papers by Zabczyk [50], Stettner [M], Ohtsubo [23]). The Yasuda's 
paper [il] stimulated further research of Rosenberg, Solan and Vieille [28] and 
Laraki and Solan [T7] in the mixed extension of the stopping game for the pro- 
cesses with continuous parameter. 
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1.2. The stopping processes by voting procedure. Let us consider p person 
stopping game related to tlie observation of a Markov cliain. Let {Xn,dn,Px), 
n = 0, 1,2, . . ., be a liomogeneous Markov chain defined on a probability space 
(f2,5^, P) with state space (E, -B). The players are able to observe the Markov 
chain sequentially. At each moment n their knowledge is represented by ^n- 
Each player has his own utility function /j : E — )■ 3f?, -i = 1, 2, . . . ,p, and at each 
moment n each player declares separately his willingness to stop the observation 
of the process. The effective ends of the process and realization of the payoffs 
appears when a suitable subset of players agree to it. The aim of each player is 
to maximize their expected payoffs. In fact, the problem will be formulated as 
a p person non-cooperative game with the concept of Nash equilibrium [21] as 
the solution. On the other hand, one can say that the considered multilateral 
stopping procedure is based on sequential voting (cf [7], [10], [13] for monotone 
rule concept and the mathematics of voting). 

Such model has been considered in mine and Yasuda paper [30] and Fergu- 
son [H]. Both papers were continuation of Masami Yasuda and his co-workers, 
Kurano and Nakagami research published in [13], [3S], [3S]' P^- They have 
investigated the multilateral version of the optimal stopping problem for inde- 
pendent, identically distributed p dimensional random vectors X„. The gain 
function of the i-th player is X^ {i-th coordinate of X„). In \2A\ the following 
class of strategies is used. 

(1) Each player can declare to stop at any stage. 

(2) The majority level r {1 < r < p) is chosen by the players at the beginning 
of the game. 

(3) During the sequential observation process, if the number of players declar- 
ing to stop is greater than or equal to the level r, the process must be 
stopped. 



This class of strategies is generalized in [36] to monotone rules. Roughly speaking, 
a monotone rule is a p variate, non-decreasing logical function defined on {0, 1}^. 
In both papers the problem is formulated as a p person, non-cooperative game 
with concept of Nash point as a solution. Paper [13] generalizes the unanimity 
case, i.e. p = r solved by Sakaguchi [21]. The motivation for the model considered 
is the secretary problem (see [S] for the formulation of the problem). A solution 
of some bivariate version of the secretary problem is given in [1A\. Presman and 
Sonin [26] treat this problem with another set of strategies. They considered the 
model in which each player's decision does not affect the stopping of the process 
but his reward only. Sakaguchi [30] and Kadane [12] have solved a multilateral 
sequential decision problem in which decisions whether to stop are made by the 
players alternately, instead of simultaneous decision under a monotone rule. 
The recent paper on the voting stopping problem are 
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2. Sensors' network and stopping games 

In [39] the construction of the mathematical model for a multivariate surveil- 
lance system is presented. It is assumed that there is net 9T of p nodes which 
register (observe) signals modeled by discrete time multivariate stochastic pro- 
cess. At each node the state is the signal at moment n G N which is at least 
one coordinate of the vector Xn G E C 'Si'^. The distribution of the signal 
at each node has two forms and depends on a pure or a dirty environment of 
the node. The state of the system change dynamically. We consider the dis- 
crete time observed signal as m > p dimensional process defined on the fixed 
probability space {Q,J-',P). The observed at each node process is Markovian 
with two different transition probabilities (see [31] for details). In the signal the 
visual consequence of the transition distribution changes at moment 6i, i & ^ 
is a change of its character. To avoid false alarm the confirmation from other 
nodes is needed. The family of subsets (coalitions) of nodes are defined in such 
a way that the decision of all member of some coalition is equivalent with the 
claim of the net that the disorder appeared. It is not sure that the disorder 
has had place. The aim is to define the rules of nodes and a construction of 
the net decision based on individual nodes claims. Various approaches can be 
found in the recent research for description or modelling of such systems (see e.g. 
|42] . [2?!). The problem is quite similar to a pattern recognition with multiple 
algorithm when the fusions of individual algorithms results are unified to a final 
decision. The proposed solution will be based on a simple game and the stopping 
game defined by a simple game on the observed signals. It gives a centralized, 
Bayesian version of the multivariate detection with a common fusion center that 
it has perfect information about observations and a priori knowledge of the sta- 
tistics about the possible distribution changes at each node. Each sensor (player) 
will declare to stop when it detects disorder at his region. Based on the simple 
game the sensors' decisions are aggregated to formulate the decision of the fusion 
center. The sensors' strategies are constructed as an equilibrium strategy in a 
non-cooperative game with a logical function defined by a simple game (which 
aggregates their decision). 

This approach uses the general description of such multivariate stopping games 
presented in the section [L2| The voting aggregation rules are relieved by the sim- 
ple game (see Ferguson [6J ) and the underlining processes form Markov sequences 
(see [10]). 

The model of disorder detection at each sensor are presented in the next sec- 
tion. It allows to define the individual pay-off of the players (sensors). It is 
assumed that the sensors are distributed in homogeneous way in the guarded 
area and the intruders behaviour are well modelled by symmetric random walk. 
By these assumptions in the section [3] the a priori distribution of the disorder 
moment at each node can be chosen in such a way that it gives the best model of 
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the structure of sensors and the behaviour of intruder . The section H] introduces 
the aggregation method based on a simple game of the sensors. The section |5] 
contains derivation of the non-cooperative game and existence theorem for equi- 
hbrium strategy. The final decision based on the state of the sensors is given by 
the fusion center and it is described in the section 16.11 The natural direction of 
further research is formulated also in the same section. A conclusion and resume 
of an algorithm for rational construction of the surveillance system is included 
in the section 16.21 

The extension of non-cooperative games to the case when the communication 
between player is allowed leads to various solutions concepts. The voting stopping 
game is interesting approach also in this direction of research. 



3. Detection of disorder at sensors 

Following the consideration of Section [l} let us suppose that the process 
{Xn-,n G N}, N = {0,1,2,...}, is observed sequentially in such a way that 

each sensor, e.g. r (gets its coordinates in the vector A„ at moment n). By 
assumption, it is a stochastic sequence that has the Markovian structure given 
random moment 6r, in such a way that the process after 6r starts from state 
Xne,.-!- The objective is to detect these moments based on the observation 
of A „ . at each sensor separately. There are some results on the discrete time 
case of such disorder detection which generalize the basic problem stated by 
Shiryaev in ^2] (see e.g. Brodsky and Darkhovsky |2j, Bojdecki [Ij, Poor and 
Hadjiliadis [2S], Yoshida [IH], Szajowski |3S]) in various directions. 

Application of the model for the detection of traffic anomalies in networks has 
been discussed by Tartakovsky et al. [Hj. The version of the problem when the 
moment of disorder is detected with given precision will be used here (see [31]). 



TieN 



3.1. Formulation of the problem. The observable random variables {X„} 

are consistent with the filtration J-'„ (or J-'n = cr{Xo, A i, . . . , X„)). The random 

vectors A„ take values in (E, i3), where E C 3f?™. On the same probability 
space there are defined unobservable (hence not measurable with respect to J-'n) 
random variables {6r}^=i which have the geometric distributions: 

(3.1) P(0, = j) = pi.~\r, g. = 1 - P. G (0, 1), J = 1, 2, . . .. 

The sensor r follows the process which is based on switching between two, time 
homogeneous and independent, Markov processes {X*„}„gN, z = 0, 1, r e OT with 
the state space (E, i3), both independent of {6',,}^^. Moreover, it is assumed 
that the processes {X*^}„gp^ have transition densities with respect to the cr-finite 
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measure /x, i.e., for any B E B we have 

(3.2) PiiXt.eB) = P{Xi.,eB\Xi.o = x)= [ f:\y)fiidy). 

Jb 

The random processes {Xm}, {^rn}^ i-^rn} ^^^ ^^^ random variables 9r are 
connected via the rule: conditionally on 6^ = k 

X = f ^rn^ iik>n, 

where {X^„} is started from X^ i^_-^ (but is otherwise independent of X^ ,). 

For any fixed dr G {0, 1, 2, . . .} we are looking for the stopping time t* E T 
such that 

(3.3) P,i\9r - r;| < dr) = sup P,(|^, - r| < dr) 

where G'^ denotes the set of all stopping times with respect to the filtration 
{^n}n<m- The parameters dr determines the precision level of detection and it 
can be different for too early and too late detection. These payoff functions 
measure the chance of detection of intruder. 

3.2. Construction of the optimal detection strategy. In [3T] the construc- 
tion of T* by transformation of the problem to the optimal stopping problem for 

the Markov process ^ has been made, such that i m = {X_r n-i-dr ni ^n)-, where 

Arn-i-d^,n = (^rn-i-d^, ■ ■ ■ ,X r v) and Jlrn is the posterior process: 

n.o = 0, 

n™ = Px {QT<n\7r^, ra = 1, 2, . . . 

which is designed as information about the distribution of the disorder instant Qr- 
In this equivalent the problem of the payoff function for sensor r is hr{Xr dr+2, «)• 

4. The aggregated decision via the cooperative game 

There are various methods combining the decisions of several classifiers or sen- 
sors. Each ensemble member contributes to some degree to the decision at any 
point of the sequentially delivered states. The fusion algorithm takes into account 
all the decision outputs from each ensemble member and comes up with an en- 
semble decision. When classifier outputs are binary, the fusion algorithms include 
the majority voting [15], [16], naive Bayes combination [3], behavior knowledge 
space [H], probability approximation [13] and singular value decomposition [18] . 

The majority vote is the simplest. The extension of this method is a simple 
game. 
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4.1. A simple game. Let us assume that there are many nodes absorbing infor- 
mation and make decision if the disorder has appeared or not. The final decision 
is made in the fusion center which aggregates information from all sensors. The 
nature of the system and their role is to detect intrusion in the system as soon 
as possible but without false alarm. 

The voting decision is made according to the rules of a simple game. Let us 
recall that a coalition is a subset of the players. Let C = {C : C C ^} denote 
the class of all coalitions. 

Definition 4.1. (see [21], [6]) A simple gam,e is coalition game having the char- 
acteristic function, 0(-) : C — )■ {0, 1}. 

Let us denote W = {C C 01 : ^(C) = 1} and £ = {(7 C 01 : 0(C) = 0}. The 
coalitions in W are called the winning coalitions, and those from C are called 
the losing coalitions. 

Assumptions 4.2. By assum,ption the characteristic function satisfies the prop- 
erties: 

(1) 01 e W; 

(2) G C; 

(3) (the monotonicity): T G S E C implies T E C. 

4.2. The aggregated decision rule. When the simple game is defined and the 
players can vote presence or absence, Xj = 1 or Xj = 0, i G 01, of the intruder 
then the aggregated decision is given by the logical function 

(4.3) 7r(a;i, Xs, . . . , Xp) = ^^ JJ a;^ JJ(1 - Xj). 

For the logical function vr we have (cf |16]) 



I I 



7r(x\ . . . , x^) = x' ■ 7r(x\ . . . ,1, . . . ,x^) + x' ■ 7r(x\ . . . , 0, . . . , x^). 
5. A NON-COOPERATIVE STOPPING GAME 

Following the results of the author and Yasuda [40j the multilateral stopping 
of a Markov chain problem can be described in the terms of the notation used 
in the non-cooperative game theory (see [21], [1], [19], [21]). Let (A„,5'n,Px), 
n = 0, 1, 2, . . . , A^, be a homogeneous Markov chain with state space (E, B). The 
horizon can be finite or infinite. The players are able to observe the Markov chain 
sequentially. Each player has their utility function fi : E — )■ 9f?, i = 1,2, ... ,p, 
such that Ej.|/j(Ai)| < oo. If process is not stopped at moment n, then each 
player, based on ^n, can declare independently their willingness to stop the 
observation of the process. 
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Definition 5.1. (see [16]) An individual stopping strategy of the player i (ISS) 
is the sequence of random variables {al^}^^^, where a^ : fi — )■ {0, 1}, such that 
o"n is S^n-measurable. 

The interpretation of the strategy is following. If cr^ = 1 then player i declares 
that they would like to stop the process and accept the realization of X„. Denote 
cr* = {a{, al, . . . , a]y) and let & be the set of ISSs of player i, i = 1,2, ... ,p. 
Define 

6 = ©1 X ©2 X ... X ©P. 

The element a = (a^, cr^, . . . , a'^)^ G & will be called the stopping strategy (SS). 
The stopping strategy a G © is a random matrix. The rows of the matrix are the 
ISSs. The columns are the decisions of the players at successive moments. The 
factual stopping of the observation process, and the players realization of the 
payoffs is defined by the stopping strategy exploiting p-variate logical function. 
Let vr : {0, 1}^ — )■ {0, 1}. In this stopping game model the stopping strategy 
is the list of declarations of the individual players. The aggregate function vr 
converts the declarations to an effective stopping time. 

Definition 5.2. A stopping time in{cr) generated by the SS a G © and the 
aggregate function tt is defined by 

t.(a) = inf {1 <n<N: nia^ al, . . . , <) = 1} 

(inf(0) = oo). Since vr is fixed during the analysis we skip index vr and write 
t((T) = t,(a). 

We have {w G fi : i^{a) = n} = {Xklli^ ^ ^ ■ ^(^L ^L • • • . O = 0} n {w G 
fi : 7r(cr^, a^, . . . , a^) = 1} G 5^„, then the random variable t^(o") is stopping time 
with respect to {dn}n=i- ^^^ ^^Y stopping time t7r(cr) and i G {1, 2, . . . ,p}, let 

r(Y ^ _ j fiiXn) if t^(cr) = U, 

UK^i^i.)} I hmsup,^^ /,(X„) if t.(a) = oo 

(cf [33], [IQ]). If players use SS cr G (5 and the individual preferences are con- 
verted to the effective stopping time by the aggregate rule vr, then player i gets 

Let V = (V\ V2, . . . , *aPY be fixed SS. Denote 

V(z) = (V\ . . . , V-\ a\ V+\ . . . , V^)^. 

Definition 5.3. (cf. [ID]) Let the aggregate rule vr be fixed. The strategy 
*cr = (*(T^,*cr^, . . . ,*cr^)^ G © is an equilibrium strategy with respect to vr if for 
each i G {1, 2, . . . ,p} and any cr* G & we have 

(5.4) Ea./j(At^(.^)) > Ea./;,(At^(v(i))). 
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The set of SS &, the vector of the utihty functions / = (/i, /2, . . . , fp) and the 
monotone rule vr define the non-cooperative game Q = ((5,/,7r). The construction 
of the equihbrium strategy *a E & in Q is provided in [IQ]. For completeness 
this construction will be recalled here. Let us define an individual stopping set 
on the state space. This set describes the ISS of the player. With each ISS of 
player i the sequence of stopping events D^ = {w : cr^ = 1} combines. For each 
aggregate rule vr there exists the corresponding set value function 11 : 5^ — )■ 5^ 
such that 7r(a^, a^, . . . , cr^) = 7r{lBi , Ii)2 , . . . , I^p} = lu{Di,Di,...,DPy For solution 
of the considered game the important class of ISS and the stopping events can 
be defined by subsets C* G ^B of the state space E. A given set C^ E B will be 
called the stopping set for player i at moment n if Dl^ = {u : X„ G C*} is the 
stopping event. 

For the logical function vr we have 

i i 

7r(x\ . . . , x^) = X* ■ vr(x\ . . . , 1, . . . , x^) + x* ■ 7r(x\ . . . , 0, . . . , x^). 
It implies that for D^ E ^ 

i 

^^^^^ u{D\...,Dn= {D'nuiD\...,n,...,DP)} 

u{D'nu{D\...,l---,Dn}- 

Let fi, Qi be the real valued, integrable (i.e. Fjx\fi{Xi)\ < oo) function defined 
on E. For fixed D^, j = 1,2, . . . ,p, j j^ i, and C E B define 

where 'Di{A) = U{D\, ..., D\-\ A, D\-^\ ...,D'{) and D{ = {co : X^ E C'}. Let 
a"*" = max{0, a} and a~ = min{0, —a}. 

Lemma 5.6. Let fi, gi, he integrable and let C^ E B, j = 1,2, . . . ,p, j ^ i, be 
fixed. Then the set *C* = {x G E : /j(x) — 5'j(x) > 0} G i3 zs such that 

iP{*C') = sup 4j{C') 
C'eB 

and 

(5.7) iji*C') = E,{f,{X,) - g,iX,)yhn,in) 

-E^if^{X^) - g,iX^))-hn,in) + E^X,). 



Based on Lemma |5.6| we derive the recursive formulae defining the equilibrium 
point and the equilibrium payoff for the finite horizon game. 
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5.1. The finite horizon game. Let horizon A^ be finite. If the equihbriuni 

strategy V exists, then we denote fj,Ar(x) = Eixfi{Xt(*a)) the equilibrium payoff 
of i-th player when Xq = x. For the backward induction we introduce a useful 
notation. Let ©^ = {{0"^}, k = n, . . . , N} be the set of ISS for moments n < 
k < N and ©„ = (5^ x (5^ x . . . x ©^. The SS for moments not earlier than n 
is V = (VS V2, . . . , "orP) G &n, where V = (<, <+i, ...,<)• Denote 

tn = tnia) = t(V) = inf {n < k < N : 7t{al al . . . , al) = 1} 

to be the stopping time not earlier than n. 

Definition 5.8. The stopping strategy '^*a = {^*cr^,"-*a'^, . . . ,"'*aP) is an equilib- 
rium in (5 „ if 

^xfi{Xt„{*a)) > E^'/j(-^t„{'a(i))) Px - a-e. 
for every i G {1, 2, . . . ,p}, where 

Denote 

At moment n = N the players have to declare to stop and Vifi{x) = fi{x). Let us 
assume that the process is not stopped up to moment n, the players are using the 
equilibrium strategies V^, z = 1, 2, . . . ,p, at moments k = n + 1, . . . ,N. Choose 
player i and assume that other players are using the equilibrium strategies V^, 
j 7^ i, and player i is using strategy a^ defined by stopping set C\ Then the 
expected payoff (y9Ar_„(X„„i, C*) of player i in the game starting at moment n, 
when the state of the Markov chain at moment ra — 1 is X„_i is equal to 



where ^*Dn{A)=Ii{*Dl . . . , *Dri, A, *D^^\ .■■,*Dl). 



By Lemma 5.6 the conditional expected gain (y97v-n.(-^Af-n, C**) attains the max- 



imum on the stopping set *C^ = {x G E : /j(x) — Vi^N-n{,x) > 0} and 

(5.1) -Ea;[(/i(X„) -t;j,Af-n(-^n))"I"D„(0)|5n-l] 

Pj;— a.e.. It allows to formulate the following construction of the equilibrium 
strategy and the equilibrium value for the game Q. 

Theorem 5.2. In the game Qwith finite horizon N we have the following solu- 
tion. 
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(i): The equilibrium value Vi{x), i = 1,2, . . . ,p, of the game Q can be cal- 
culated recursively as follows: 

(1) Vifl{x) = fi{x); 

(2) For n = 1,2, . . . ,N we have Px—a.e. 

-^xiifii^N-n+l) - 'yi,n-l(^Ar-n.+l))"I"Djv-„+i(0)l5'^-™] 

+'Eix[Vi,n-l{XN-n+l)\dN-n], 

fori = 1,2,..., p. 
(ii): The equilibrium strategy *a E & is defined by the SS of the players *al^, 
where *ai^ = 1 if Xn e *Q, and *Cl = {x e E : fi{x) - Vi^N~n{x) > 0}, 
n = 0,l,...,N. 

We have Vi{x) = Vi^jyix), and E^fi{Xt(^,„)) = fi,Ar(x), i = l,2,...,p. 

6. Infinite horizon game 



In this class of games the equihbrium strategy is presented in Definition |5.3 
but in class of SS 

&} = {a ee* : E.Jr{Xt(^^)) < oo for every x eE, i = 1,2, . . . ,p}. 

Let V G &*j- be an equilibrium strategy. Denote 

Vi{x) = E^Ji{Xt(*a))- 

Let us assume that ("+^)*o- g ©/•„+! is constructed and it is an equilibrium 
strategy. If players j = 1,2, . . . ,p, j ^ i, apply at moment n the equilibrium 
strategies *a^^ , player i the strategy crj^ defined by stopping set C* and ('^+^)V at 
moments n + l,n + 2, . . ., then the expected payoff of the player i, when history 
of the process up to moment n — 1 is known, is given by 



(Pn{Xn-l, C) — Ex„_i fi{Xn)l^*D^(^Dl;) + ^^i(-^n)I»D„{Dj; 



where "D„(A) = U{*Dl ..., *Dl\ A, *D'+\ ■■■,*DpJ, *Di = {u e Q : *ai = 1}, 
j = l,2,...,p,j^t,andDl^ = {uen:al, = l} = l} = {uen:X^e O}. By 



Lemma 5.6 the conditional expected gain y9„(X„_i, C*) attains the maximum on 
the stopping set *Q = {x G E : /j(x) > Vi{x)} and 

-E,[(/,(X„) - i;i(XO)-I"D„(0)|^n-l] 
+E.,[Vi{Xr,)\^a-l]. 
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Let US assume that there exists solution {wi{x),W2{x), . . . ,Wp{x)) of the equa- 
tions 

(6.1) Wi{x) = E,{fi{X,) - Wi{X,))n,,D,in) 

-E,(/,(Xi) - w,{X^))-h,D,w + E,Wi{X,), 

i = 1,2, ... ,p. Consider the stopping game with the following payoff function 
for 2 = 1, 2, ... ,p. 

' Mx) iin<N, 



'^^•^(^)- 'I Vi{x) iin>N. 

Lemma 6.2. Let *cr g &*f be an equilibrium strategy in the infinite horizon game 
Q . For every N we have 

Let us assume that for z = 1, 2, . . . ,p and every x G E we have 
(6.3) E^[sup„gp^/+(X„)] < oo. 

Theorem 6.4. Let (X„, 5^„, 'Px)^=o be a homogeneous Markov chain and the pay- 
off functions of the players fulfill loi). If t* = t{*o), *a G 6j then E^fi{Xt*) = 



Vi{x). 

Theorem 6.5. Let the stopping strategy *a G &*f be defined by the stopping sets 
*C^ = {a; G E : /j(x) > Vi{x)}, i = 1,2, . . . ,p, then *a is the equilibrium strategy 
in the infinite stopping game Q. 

6.1. Determining the strategies of sensors. Based on the model constructed 
in Sections |3]-[5] for the net of sensors with the fusion center determined by a sim- 
ple game, one can determine the rational decisions of each nodes. The rationality 
of such a construction refers to the individual aspiration for the highest sensitivity 
to detect the disorder without false alarm. The Nash equilibrium fulfills require- 
ment that nobody deviates from the equilibrium strategy because its probability 
of detection will be smaller. The role of the simple game is to define wining 
coalitions in such a way that the detection of intrusion to the guarded area is 
maximal and the probability of false alarm is minimal. The method of construct- 
ing the optimum winning coalitions family is not the subject of the research in 
this article. However, there are some natural methods of solving this problem. 

The research here is focused on constructing the solution of the non-cooperative 
stopping game as to determine the detection strategy of the sensors. To this end, 
the game analyzed in Section [5] with the payoff function of the players defined 
by the individual disorder problem formulated in Section [3] should be derived. 

The proposed model disregards correlation of the signals. It is also assumed 
that the fusion center has perfect information about signals and the information 



On the Masami Yasuda stopping game 



13 



is available at each node. The further research should help to qualify these real 
needs of such models and to extend the model to more general cases. In some 
type of distribution of sensors, e.g. when the distribution of the pollution in the 
given direction is observed, the multiple disorder model should work better than 
the game approach. In this case the a priori distribution of disorder moment 
has the form of sequentially dependent random moments and the fusion decision 
can be formulated as the threshold one: stop when k* disorder is detected. The 
method of a cooperative game was used in [S] to find the best coalition of sensors 
in the problem of the target localization. The approach which is proposed here 
shows possibility of modelling the detection problem by multiple agents at a 
general level. 

6.2. Final conclusion concerning the disorder detection system. In a 

general case the consideration of the paper [39j leads to the algorithm of con- 
structing the disorder detection system. 




Figure 1. Sudety Mountains. International Conference on Math- 
ematical Statistics STAT'2000. Szklarska Por§ba, Poland, August, 
2000 
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Figure 2. RIMS Conference 2002, Kyoto, Japan. From the left: 
the author of this note, professors Masami Yasuda, Vladimir V. 
Mazalov and Mitsushi Tamaki 



6.2.1. Algorithm. 

(1) Define a simple game on the sensors. 

(2) Describe signal processes and a priori distribution of the disorder mo- 
ments at all sensors. Establish the a posteriori processes: Tin = (Hin, . . . , n„ 
where Ukn = PiO < n\J^n)- 

(3) Solve the multivariate stopping game on the simple game to get the in- 
dividual strategies of the sensors. 



7. The contribution to the mathematical education, the 
scientific cooperation and the friendship 

It was 1994 when I came to Japan for the first time based on Prof. Minoru Sak- 
aguchi and Prof. Katsunori Ano invitation to take part in the International Con- 
ference on Stochastic Models and Optimal Stopping, Nanzan University, Nagoya. 
Since this event Professor Masami Yasuda is my guide in the mathematics and 
the Japanese culture. When the Internet connected the people we discussed 
the game model, which I call myself the Masami Yasuda game described in this 
note in the sections 11.21 and [2l Based on the discussion we have written the 
papers |10] and [IT]. Our meeting and discussion were in Poland (see Figure [I]), 
Japan (see Figure [2]) and Russia (see Figure |3|. Last year, during the one day 
workshop, the possible further research and academic cooperation was the topic 
of our discussion. Based on European Union integration processes the Polish 
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Figure 3. 13*^ Symposium of ISDG 2008, St. Petersburg, Russia 



educational system is under very intensive reconstruction process. The mathe- 
matical education of engineering faculties students is very fragile task. Professor 
Yasuda provides us his extensive academic experience by his contribution to our 
local conferences devoted to teaching mathematics for non-mathematical major 
students. Such organized events were in Ibaraki National College of Technology, 
Hitachinaka (2006) and Wroclaw Univeristy of Technology (2008). 
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